A comparative study of constrained and unconstrained approaches for segmentation of speech signal
نویسندگان
چکیده
In this work, we compare different approaches for speech segmentation, of which some are constrained and the remaining are unconstrained by phone transcript. A high accuracy speech segmentation can be obtained by approaches constrained by phone transcript such as HMM forced-alignment when exact phone transcript is known. But such approaches have to adjust with canonical phone transcript, as exact phone transcript is tough to obtain. Our experiments on TIMIT corpus demonstrate that ANN and HMM phone-loop based unconstrained approaches, perform better than HMM forced-alignment based approach constrained by canonical phone transcript. Finally a detailed error analysis of these approaches is reported.
منابع مشابه
A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement
A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...
متن کاملA Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملA Time-Frequency approach for EEG signal segmentation
The record of human brain neural activities, namely electroencephalogram (EEG), is generally known as a non-stationary and nonlinear signal. In many applications, it is useful to divide the EEGs into segments within which the signals can be considered stationary. Combination of empirical mode decomposition (EMD) and Hilbert transform, called Hilbert-Huang transform (HHT), is a new and powerful ...
متن کاملComparative Performance Study of Tuned Liquid Column Ball Damper for Excessive Liquid Displacement on Response Reduction of Structure
The tuned liquid column damper (TLCD) having a uniform cross-sectional tube of U-shaped, occupied with liquid is used as a vibrational response mitigation device. The tuned liquid column ball damper (TLCBD) is a modified TLCD, where, an immovable orifice, positioned at the middle part of the horizontal portion, is replaced by a metal ball. Different studies on the unconstrained optimization per...
متن کاملAn Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform
In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...
متن کامل